The Long-Short Story of Movie Description
نویسندگان
چکیده
Generating descriptions for videos has many applications including assisting blind people and human-robot interaction. The recent advances in image captioning as well as the release of large-scale movie description datasets such as MPII-MD [28] and M-VAD [31] allow to study this task in more depth. Many of the proposed methods for image captioning rely on pre-trained object classifier CNNs and Long ShortTerm Memory recurrent networks (LSTMs) for generating descriptions. While image description focuses on objects, we argue that it is important to distinguish verbs, objects, and places in the setting of movie description. In this work we show how to learn robust visual classifiers from the weak annotations of the sentence descriptions. Based on these classifiers we generate a description using an LSTM. We explore different design choices to build and train the LSTM and achieve the best performance to date on the challenging MPII-MD and M-VAD datasets. We compare and analyze our approach and prior work along various dimensions to better understand the key challenges of the movie description task.
منابع مشابه
War, Trauma, Memory in Selected Short Stories of Fire and Forget Edited by Roy Scranton and Matt Ghalagher and A Vital Killing by Ahmad Dehghan
This article is a comparative study of similar experiences in the American short story collection, Fire and Forget: Short Stories from the Long War edited by Roy Scranton and Matt Ghalagher and the Persian short story collection, A Vital Killing by Ahmad Dehghan as they belong to two different languages, different cultures, and different worldviews. It is an exploration of an overwhelmed psycho...
متن کاملNader and Simin—A Separation: A Deconstructive Reading
Nader and Simin—A Separation is the first Iranian movie which won manyinternational awards as well as admiration from critics and the public. Many reviewsby critics, however, have revolved around problems in spousal relationships ofcouples in different social classes of Iran. Through highlighting the self-evidentdismantling elements and unreliable readings in the acting, directing, and even the...
متن کاملA psychological analysis of the movie Under the Smokey Roof (2017) based on the family therapy theories
Movies are considered an effective educational resource for students, especially those who study Psychology. The purpose of this study is to analyze the movie "Under the Smokey Roof" directed by Pouran Derakhshandeh, based on the family therapy theories. This movie shows the story of a family struggling with different social and psychological issues. In this article, a descriptive-analytical me...
متن کاملThe Comparative Effects of Using Electronic Short Story Books and Tradi-tional Printed Texts on EFL Learners’ Reading Comprehension
The purpose of this study was to investigate the comparative effect of using electronic short story books and traditional printed texts on EFL learners’ reading comprehension. For that purpose, ninety female learners ranging in age between fifteen and thirty five sat for the language proficiency test (PET, 2009) as the test of homogeneity and consequently sixty students were selected based on t...
متن کاملThe Study of Stage Description (Didascalia) in Tennessee Williams’ One Act Plays
Stage Description or, as Michael Issacharoff says, Didascalia, is a unique characteristic of Drama which has been part of plays from the beginning of the tradition of playwriting in mankind’s history. Didascalia in modern plays has found a complex function, helping playwrites as a technical tool as well as creating a richer vision of the characters, transfer plot in a betterr way, showing the c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015